0704-883-0675     |      dataprojectng@gmail.com

Generic Metadata Handling in Scientific Data Life Cycles

  • Project Research
  • 1-5 Chapters
  • Abstract : Available
  • Table of Content: Available
  • Reference Style: APA
  • Recommended for : Student Researchers
  • NGN 5000

Introduction

This chapter introduces the dissertation by describing its context, the identified challenges, how the chosen challenge was met, the achieved impact, relevant publications, and how the thesis is structured. 1.1 Context In science data is the essential focal point in todays computational and quantitative approaches to scientific knowledge gain. Computational simulations enable far reaching explorations of modeled realities while quantitative methods gather data to improve the understanding of observed phenomena. These methods are increasingly viable only via high-end storage and large-scale High Performance Computing resources with individual requirements dramatically rising. Data throughputs involve gigabytes per second continuously, volumes are of petabyte magnitude, continuous files per second rates are in the double-digit range, and a vast universe of complex data representations exists. The great potential of such data is evident by the current trend of Big Data in science that aims at large-scale information extraction to foster scientific discoveries. This is fundamentally enabled by intelligently handling data and by combining a large variety of information technology methods to so-called data life cycles. In principle, these consist of data sources, systems to manage data as well as compute resources, methods for access rights management, utilization interfaces and data sinks. Scientists are naturally focused on their particular research. Thus, metadata is an essential step forward in the efficiency of use as it enables managing data based on its content instead of location. Via specific data life cycles scientists are freed from the necessity to extensively deal with IT infrastructures while still utilizing them to drive their research by handling their extensive data and computing demands. In this complex technological environment, a plethora of significant challenges presents itself that hinders the advancement of the state-of-the-art in data-driven knowledge gain. 1.2 Challenges Vital challenges in managing data life cycles are manifold. Federated authentication and authorization infrastructures need to be integrated while being mindful of the overall resilience of increasingly complex data life cycles. The increasing numbers of files and data amounts need to be managed by Big Data systems. These in turn need to be efficiently integrated with High Performance Computing resources for analysis which signifies the need for advanced interoperability. Besides automated pre- and postprocessing, the user-friendly creation, and execution of workflows to encapsulate complex analysis procedures need to be supported. Integrated scientific environments need to be provided that hide the underlying complexity while enabling that use. Essential is also the building of trust that an infrastructure delivers 6 1. INTRODUCTION what it promises. Closely connected is moving from a fixed-term build up phase to a sustainable operation phase. As these goals are partly opposing to each other, a effective balance between them needs to be developed for each data life cycle. The dissertation focuses on the major challenge of the organization of large numbers of files in the million range using information about data, so-called metadata. Currently, solutions are often either use case specific or lacking completely, thus, preventing easy access and re-use. Without metadata, users have to remember where an individual file is located. With a large number of files this is inefficient if not impossible. This especially holds true for Big Data use cases with a large number of files with complex content and stored in distributed locations. Currently, significant efforts need to be made to implement even narrowly applicable and pragmatic metadata handling solutions for every new scientific experiment.




FIND OTHER RELATED TOPICS


Related Project Materials

Design and Implementation of AI-Powered Plagiarism Detection in Postgraduate Research in Federal University, Kashere, Gombe State

Background of the study

Plagiarism in academic research, particularly at the postgraduate level, is a growing concern in higher education...

Read more
THE PERCEPTION OF STUDENTS TOWARDS THE INTRODUCTION OF WEB PAGE DESIGN INTO THE OTM CURRICULUM USING FEDREAL POLYTECHNIC NEKEDE, OWERRI AS A CASE STUDY

BACKGROUND OF THE STUDY

The global economy has evolved from its traditional structure into one that is...

Read more
EFFECT OF POPULATION GROWTH ON UNEMPLOYMENT IN NIGERIAN ECONOMY

ABSTRACT

This thesis examines the effect of population growth on unemployment in Nigeria using Autoregressive Distributed Lag Bounds (ARD...

Read more
An evaluation of government interventions in flood control in Ilorin East Local Government Area, Kwara State

Background of the study
Flooding is a recurring natural disaster that severely impacts urban and rural communities. In Ilo...

Read more
A Study on the Effect of Tax Policies on Stock Market Volatility in Nigeria

Background of the Study
Tax policies play a crucial role in shaping the investment climate by affecting both the profitabi...

Read more
An investigation of the effect of financial regulations on rural agricultural banking: a case study of AB Microfinance Bank

Background of the Study

Financial regulations play a crucial role in shaping the operations of rural agricultural banking. AB Microfinanc...

Read more
An evaluation of maintenance charge policy reforms on reducing operational costs in banking: a case study of Stanbic IBTC Bank Nigeria

Background of the Study
Maintenance charge policy reforms have been pivotal in addressing the escalating operational costs...

Read more
THE EFFECT OF TRAINING ON WORKER’S PERFORMANCE IN AN ORGANIZATION

ABSTRACT

A survey research design was used for this study. The survey design was appropriate for t...

Read more
AN ANALYSIS OF REPORT CASES OF CRIMES IN NIGERIA FROM 1995 - 2004

BACKGROUND OF STUDY

Criminality is inherent in human nature and culture. As a result, no civilization can claim to be fu...

Read more
Assessing the Effectiveness of Fact-Checking Mechanisms in Sokoto North Local Government Area, Sokoto State

Background of the Study

Fact-checking has emerged as a critical tool for combating misinformation and safeguarding media...

Read more
Share this page with your friends




whatsapp